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ABSTRACT 

We present a method for extracting the expected cosmological 21-cin signal from 
the epoch of reionization, taking into account contaminating radiations and random 
instrumental noise. The method is based on the maximum a-posteriori probability 
(MAP) formalism and employs the coherence of the contaminating radiation along 
the line-of-sight and the three-dimensional correlations of the cosmological signal. We 
test the method using a detailed and comprehensive modeling of the cosmological 
21-cm signal and the contaminating radiation. The signal is obtained using a high 
resolution N-body simulation where the gas is assumed to trace the dark matter and 
is reionized by stellar radiation computed from semi-analytic galaxy formation recipes. 
We model contaminations to the cosmological signal from synchrotron and free-free 
galactic foregrounds and extragalactic sources including active galactic nuclei, radio 
haloes and relics, synchrotron and free- free emission from star forming galaxies, and 
free-free emission from dark matter haloes and the intergalactic medium. We provide 
tests of the reconstruction method for several rms values of instrumental noise from 
CTp, = 1 to 250 mK. For low instrumental noise, the recovered signal, along individual 
lines-of-sight, fits the true cosmological signal with a mean rms difference of dims ~ 
1.7 ± 0.6 for — 1 mK, and c?rms ~ 4.2 ± 0.4 for cr„ = 5 mK. The one-dimensional 
power spectrum is nicely reconstructed for all values of considered here, while the 
reconstruction of the two-dimensional power spectrum and the Minkowski functionals 
is good only for noise levels of the order of few mK. 
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1 INTRODUCTION 

The Wilkinson Microwave Anisotropy Probe (WMAP, 
Spergel et al. 2007) polarization measurement of the cos- 
mic microwave background (CMB) indicates a value of r = 
0.089 ± 0.030 for the optical depth for Thomson scattering 
with intervening free electrons in the intergalactic medium 
(IGM). This value for the optical depth implies the pres- 
ence of ionizing radiation at redshifts z > 10 when the Uni- 
verse is less than half a Gyr old (see Meiksin 2007 for a 
review). The ionizing radiation could be of varying nature 
depending on the type of available sources as a function 
of time. X-ray radiation emitted from mini black holes (Ri- 
cotti & Ostriker 2004) could be responsible for an early stage 
of (pre)-reionization. Because of their large mean free path 
for photo-ionization. X-rays tend to maintain the IGM at 
only partial reionization although full reionization is possi- 
ble near the sources (Zaroubi et al. 2007; Thomas & Zaroubi 
2007). This possible X-ray pre-reionization is followed by UV 
reionization which begins by the formation of individual H II 



bubbles embedded the neutral medium. At a later stage H II 
bubbles overlap until complete reionization is obtained. 

The CMB polarization measurement is proportional to 
the line-of-sight integral of the ionized fraction and, there- 
fore, is incapable of constraining the details of the reion- 
ization process. Absorption features in the spectra of high 
redshift quasars offer an alternative probe of reionization. 
However, the dearth of quasars at the relevant redshifts lim- 
its the applicability of this probe. Currently, a great deal 
of effort is being made at observing the 21-cm signal from 
neutral hydrogen from high redshifts. If the spin tempera- 
ture of the high redshift H I is different from the CMB then 
redshifted 21-cm radiation could be detected by radio obser- 
vations at the relevant frequency range (around 130 MHz for 
gas at 2: ~ 10). Maps of this cosmological signal constrain 
the three-dimensional (3D) distribution of H I in the Uni- 
verse, and, therefore, they could prove to be the most useful 
probe of reionization. These maps will contain 3D informa- 
tion on the distribution of H I and the ionized fraction as 
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a function of time. Hopefully, in the near future radio tele- 
scopes like the Low Frequency Array (LOFAR),*^ the Preci- 
sion Array to Probe Epoch of Roionization (PAPER), ^ the 
Mileura Widefield Array (MWA), ^ the 21 Centimeter Ar- 
ray (21CMA, formerly PaST),§ and the Square Kilometer 
Array (SKA), ^ will be able to provide such 3D maps. 

One of the main obstacles in extracting cosrnological 
information from redshifted 21 cm maps is the subtrac- 
tion of non-cosmological contaminations due to instrumen- 
tal noise, ionospheric distortion, galactic and extragalactic 
foregrounds. In the last few years an effort was made to 
model the galactic foregrounds (e.g. Jelic et al. 2008; San- 
tos, Cooray & Knox 2005; Wang et al. 2006; Zaldaxriaga, 
Furlanetto & Hernquist 2004), and the extragalactic fore- 
grounds (e.g. Cooray & Furlanetto 2004; Di Matteo et al. 
2002; Di Matteo, Ciardi & Miniati 2004; Jelic et al. 2008; 
Oh & Mack 2003; Santos, Cooray & Knox 2005). In the first 
part of this work we model the galactic and extragalactic 
foregrounds and produce 3D maps of the extragalactic fore- 
grounds. 

The question of removing the foregrounds has been dis- 
cussed before. Most of the previous papers have focused 
on the angular power spectrum of the signal and assumed 
rather limited spectral resolution (e.g. Di Matteo et al. 2002; 
Di Matteo, Ciardi & Miniati 2004; Oh & Mack 2003; Santos, 
Cooray & Knox 2005; Zaldarriaga, Furlanetto & Hernquist 
2004; Morales & Hewitt 2004; Morales, Bowman & Hewitt 
2006). In general subtraction methods that have been sug- 
gested so far includes three stages: bright source removal 
(e.g. Di Matteo et al. 2002; Di Matteo, Ciardi & Miniati 
2004), spectral fitting (Jelic et al. 2008; Zaldarriaga, Furlan- 
etto & Hernquist 2004; Santos, Cooray & Knox 2005; Wang 
et al. 2006), and residual error subtraction (Morales & He- 
witt 2004; Morales, Bowman & Hewitt 2006). Using the fact 
that the foregrounds which contaminate the 21-cm cosmo- 
logical signal have much smoother frequency spectra, Wang 
et al. 2006 developed a method to remove the foregrounds 
along the line-of-sight. Based on the maximum a-posteriori 
probability (MAP) formalism we develop a method to recon- 
struct the 3D maps of the 21-cm cosrnological signal from 
the contaminated noisy data. This method also relies on the 
smoothness of the contaminating radiation and further as- 
sumes a prior for the correlation properties of the cosmologi- 
cal signal. On the other hand, in this method bright sources 
does not get any special treatment so no empty holes are 
left in the map. The random noise is also included in the 
method so there is no need for the three stages framework. 

The paper is organized as follows. In § 2, we briefly 
review the physics behind the 21-cm line. A brief descrip- 
tion of the simulation appears in § 3. In § 4 we present our 
models for the different foregrounds, and in § 5 we describe 
the method for the cosmological signal reconstruction. We 
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conclude with a summary and a discussion of the results in 
§6. 



2 THE 21-CM COSMOLOGICAL SIGNAL 

The 21-cm line is produced in the transition between the 
triplet and singlet sub-levels of the hyperfine structure of 
the ground level of neutral hydrogen atoms. This wavelength 
corresponds to a frequency of 1420 MHz and a temperature 
of T* = 0.068 K. The spin temperature, Tb, is defined ac- 
cording to the relative population of the triplet, , to the 
singlet, rig , sub-levels 

^=3exp(-^). (1) 

An H I region would be visible against the CMB either in 
absorption if Tg < T^fj^^^ or emission if % > T^mb' where 
T^^B ^ 2.73(1 + 2:) K is the CMB temperature. There are 
various mechanisms for raising Ts significantly above T^^^g 
during the epoch of reionization and hence a significant cos- 
mological 21-cm signal is expected. 

Intensities, /(i'), at radio frequency are expressed 
in terms of brightness temperature, defined Eis Tb = 
I{v)(? / (2feg v^) , where c is the speed of light and fc^ is Boltz- 
mann's constant. The differential brightness temperature of 
the 21-cm line emission against the CMB of a small patch 
of gas at redshift z is (e.g. Ciardi & Madau 2003) 

,5Tb = 16mK x„ i(l + 5) (l - 

^002) Kt) (a:)J ' (2) 

where x-^ j is the fraction of H I in the patch, and 5 — p/p— 1 
is the density contrast of the gas. In this paper we assume 
Ts > T^MB (e-g- Madau, Meiksin & Rees 1997; Nusser 
2005a; Chuzhoy & Zheng 2007). In the top panel of Fig. 
1 wc present a typical cosmological 21-cm signal along one 
line-of-sight from z = 11 {1^ ^ 120 MHz) to z = 7 (:/ « 180 
MHz) where the gas in the IGM is completely ionized in our 
simulation (see § 3) . We also calculated the mean differential 
brightness temperature, (5Tb) , for this line-of-sight (dashed 
line) 11. In the bottom panel we show the mean ionization 
fraction, /ion of the simulation box. As expected (5Tb) de- 
creases as /ion increases. 



3 THE SIMULATION 

In order to produce 3D maps of the cosmological 21-cm sig- 
nal and the foregrounds, we use high resolution N-body sim- 
ulation where the gas is assumed to trace the dark matter 
and is reionized by stellar radiation estimated from semi- 
analytic galaxy formation recipes (Benson et al. 2001 & 

II The fitting formula for the iiieaii differential brightness tem- 
perature is 5Tb = exp[ag -|- ln(i//i/t) -|- ln^(i//i/t)], where 
fc"* = 150 MHz. The same formula will be use in §5 to fit the 
foregrounds contamination. 
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2006). We calculate from the simulation the dark matter 
and gas number density, the mass and number of dark mat- 
ter haloes, and the gas ionization fraction, all as a function 
of redshift. 

The simulation box size is 141.3 h-^Mpc comovmg 
and it contains 256'^ particles, where each particle mass 
is Afp = 2 X 10^" M0. We use a ACDM cosmology with 
fim = 0.3, Q.A = 0.7, h = 0.7, and as = 0.9. The galaxy 
formation model includes Compton cooling, H2 cooling and 
weak feedback from supernova explosions, and the ionizing 
photons escape fraction from galaxies is 0.15 (Benson et al. 
2006). The reader is referred to that paper, and references 
therein, for a full description of the simulation. 



4 FOREGROUNDS 

The 21-cm cosmological signal will suffer from several 
sources of contamination. Here we focus on the contami- 
nation which is produced by foreground radiation emitted 
from galactic and extragalactic sources and add a random 
instrumental noise with a rms value . We have assumed 
that the necessary correction of ionospheric distortion, by 
means of radio adaptive optics, and the calibration of the 
time-variable gain, phase, and polarization of each antenna 
of the radio telescope array, were already taking care of. 
Although the contaminating radiation could be as much as 
10^ times larger than the cosmological signal (e.g. Gnedin 
& Shaver 2004; McQuinn et al. 2006; Morales, Bowman & 
Hewitt 2006), their typical power-law dependence on fre- 
quency should allow us to disentangle the small scale varia- 
tions of the cosmological signal. In the following we describe 
our modeling of the foreground contaminations. 

Two physical processes are responsible for most con- 



taminations: synchrotron radiation and free-free emission. 
Synchrotron radiation arises from acceleration of relativistic 
electrons in magnetic fields. Assuming a power-law depen- 
dence of the electron number density on energy, N{E) ~ 
E where E is the electron energy, one obtains a syn- 
chrotron flux Jsyn ~ with spectral index a = (7 — 
l)/2. The corresponding brightness temperature is Tb oc 
lBy-n{y)i'~'^ OC where /3 = at ~\- 2. We will consider 
synchrotron radiation from our Galaxy, star forming galax- 
ies, radio galaxies and radio haloes and relics in clusters. 
For each of these systems we will assume a power-law syn- 
chrotron emissivity consistent with the respective observa- 
tional constraints (see below). 

The emissivity of the free-free radiation from diffuse 
ionized gas is (Rybicki & Lightman 1979): 

e.("c,re) = 5.4 X 10'^''n^r-'/^ffff(i/,re) 

-hv/k„Ta -3 -1 TT -1 -1 

X e ' B ■= erg cm s Hz sr , 

(3) 

where the Gaunt factor is approximately gs{v,Tc) ~ 
llMT°-^^v-°-^ in the radio regime (Lang 1999), and 
Tc are the electron number density and temperature, and 
I' ~ J'obs(l + z) is the emitted frequency, with i/ohs the ob- 
served frequency. Therefore, the free-free flux density also 
has a power-law dependence on frequency with spectral in- 
dex a = 0.1 (/3 = 2.1). 



4.1 Galactic foregrounds 

The galactic foregrounds are generated by synchrotron, free- 
free, thermal and spinning dust emissions. In the relevant 
frequency range, between ~100 and ~200 MHz, galactic syn- 
chrotron emission is the most dominant foreground, respon- 
sible for ~98.5% of the total galactic contamination, free- free 
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emission from the diffuse ionized hydrogen in the interstel- 
lar medium (ISM) contributes ~1.5% (Shaver et al. 1999), 
and dust emission is negligible (Platania et al. 2003; Reich, 
Testori & Reich 2001). One should also take into account 
radio recombination lines from our Galaxy. Since these lines 
occur at specific frequencies, template spectra can be used 
to remove them, without previous knowledge of their actual 
intensity (e.g. Morales, Bowman & Hewitt 2006). 

The synchrotron emission process is probably due to ra- 
diation from high energy cosmic-ray electrons above a few 
MeV in the galactic magnetic field (e.g. Pacholczyk 1970; 
Banday & Wolfendale 1990, 1991). There is less contami- 
nation in relatively smooth regions away from the galactic 
plane and galactic loops. Following Wang et al. (2006) we 
assume a running power-law in frequency for the galactic 
synchrotron brightness temperature, 



Tsyn — ^syn 



-A/3,y„log(^/^.) 



(4) 



where ^syn is synchrotron brightness temperature at ly, = 
150 MHz, and /3syn and A/3syn are the spectral index and 
spectral running index, respectively. Using the 408 MHz 
all-sky continuum survey of Haslam et al. (1981, 1982), 
Haverkorn, Katgert & de Bruyn (2003) estimated the mean 
brightness temperature at 408 MHz to be ~33 K with tem- 
perature uncertainty of ~10%. After subtraction of the ~2.7 
K contribution of the CMB and the ~3.1 K contribution of 
extragalactic sources (e.g. Bridle 1967; Lawson et al. 1987; 
Reich & Reich 1988), the diffuse synchrotron galactic back- 
ground is ~27.2 K. Assuming a spectral index of 2.74 (Pla- 
tania et al. 2003) we estimate the synchrotron brightness 
temperature at 150 MHz to be ^syn = 442.0 ± 44.2 K. At 
high galactic latitudes, the brightness temperature can drop 
to minimum of ~200 K (Lawson et al. 1987; Reich & Reich 
1988; Shaver et al. 1999). The estimations for the mean spec- 
tral index around 150 MHz are ranged from 2.6 to 2.8 (e.g. 
Bridle 1967; Willis et al. 1977; Lawson et al. 1987; Reich & 
Reich 1988; Banday & Wolfendale 1990, 1991; Tegmark et 
al. 2000; Platania et al. 2003) with indications for disper- 
sion at each position on the sky, due to distinct components 
along the line-of-sight (e.g. Lawson et al. 1987; Reich & Re- 
ich 1988; Banday & Wolfendale 1990, 1991; Shaver et al. 
1999). We choose spectral index of Psyn = 2.7 with disper- 
sion of 0.1 (Reich & Reich 1988; Shaver et al. 1999), and 
spectral running index of A/3syn ~ 0.1 (Tegmark et al. 2000; 
Wang et al. 2006). 

Free-free thermal emission appears in ionized regions in 
the ISM, with electron temperature of Tc > 8000 K. As for 
the galactic synchrotron, we assume a running power-law in 
frequency for the galactic free-free brightness temperature 
(Wang et al. 2006), 



(5) 



where An is the free-free brightness temperature at = 150 
MHz, and /3ff and A/Sg are the spectral index and spectral 
running index, respectively. Maintaining the 70:1 ratio be- 
tween the synchrotron and free-free emission (Shaver et al. 
1999), we adopted Ag = 6.33 ± 0.63 K assuming 10% tem- 
perature uncertainty. At high frequencies (v > 10 GHz) the 
brightness temperature spectral index is /3s = 2.15, while at 
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Figure 2. The Willott et al. (2001) RLF (model C) for ACDM 
cosmology at seven redshifts from ^ = 0.1 to 2 = 6. 



low frequencies it drops to jSg — 2.0 due to optically thick 
self-absorption (Bennett et al. 2003). Since the gas is opti- 
cally thin above few MHz, the brightness temperature spec- 
trum between ~100 and ~200 MHz is well determine with 
a spectral index of = 2.10 ± 0.01 (Shaver et al. 1999), 
and the spectral running index is A/3ff = 0.01 (Tegmark et 
al. 2000; Wang et al. 2006). 



4.2 Extragalactic foregrounds 

The total emission of extragalactic foregrounds has been es- 
timated both directly and from integrated source counts. At 
150 MHz, its contribution to the contamination varies from 
~30 K (Willis et al. 1977; Cane 1979) to ~50 K (Bridle 1967; 
Lawson et al. 1987; Reich & Reich 1988) . These foregrounds 
produce ~10% of the total contamination on average, but 
can reach ~25% at high galactic latitudes, where the mini- 
mum brightness temperature of the diffuse galactic emission 
drops to ~200 K (Lawson et al. 1987; Reich & Reich 1988; 
Shaver et al. 1999). 

Most foregrounds are due to radio point sources and 
relate to active galactic nuclei (AGN) activity. Radio haloes 
and radio relics are also significant foregrounds, but since 
they appear only in rich galaxy clusters, there are rare. The 
remaining extragalactic foregrounds, which are also the less 
significant ones, are synchrotron and free- free emission from 
star forming galaxies, and free-free emission from ionized 
hydrogen dark matter haloes and diffuse IGM. 



4-2.1 Radio emission from AGNs 

Radio emission from AGNs is linked to the accretion of gas 
on massive black holes at the centers of galaxies. In order to 
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incorporate this contamination in the simulation we deter- 
mine the number of AGNs in the simulation box according 
to the radio luminosity function (RLF), and assume that 
the AGN distribution follows the underlying mass density 
field. The RLF is defined as the number of radio galaxies 
and quasars per comoving volume per unit (base 10) loga- 
rithm of luminosity, p{L, z), where L is the radio luminosity 
derived from the fiux-density S and redshift per frequency 
and steradians. 

We use the RLF model of Willott et al. (2001) to calcu- 
late the background fiux density from AGNs, assuming the 
luminosity of an AGN has a power-law dependence on fre- 
quency with a spectral index of 0.80 ±0.15. They used three 
redshift surveys of flux-limited samples selected at the radio 
frequency range of our interest: 7CRS (at 151 MHz with flux 
density Sisi > 0.5 Jy), 3CRR (at 178 MHz with 5178 > 10.9 
Jy), and 6CE (at 151 MHz with 2.0 > 5i5i < 3.93 Jy). 

Willott et al. (2001) separated the radio sources to low- 
and high-luminosity populations. A combination of the low- 
luminosity RLF, /9j, and the high-luminosity RLF, p^^, gives 
the total RLF 



p(L,z) = pj -f pj,. 



(6) 



They checked three models for the redshift distribution. In 
our work we use their redshift distribution which has a one- 
tailed Gaussian rise to the peak redshift and then a one- 
tailed Gaussian decline at higher redshifts (model C) with 
S7m ~ f^A ~ cosmology (for more details see §3 in their 
paper). 



The low-luminosity RLF is 



Pi = 



fto 1- 



exp 



(1 + ^) 



fto(l^) exp(^) (l + z,„)* 



Z < 

Z > Z„ 



(7) 



where log(pjo) = -7.523, log(Li.) = 26.48, Qi = 0.586, h = 
3.48, and z^^ = 0.710. 



The high-luminosity RLF is 



(8) 



where log(pi,J = -6.757, log(Lh.) = 27.39, Qh = 2.42, and 
the high-luminosity evolution function /j^ (2) is 



/h (^) = exp 



for z < 2|^„ , and 



/h (^) = exp 



for z > Zj^g, where z^^ 



2.03, 2i,i = 0.568, and z^^ 



(9) 



(10) 



0.956. 



To convert the RLF with flm = = cosmology, p^, 
to RLF of ACDM cosmology, p^cdm 1 used the following 
relation from Peacock (1985) 



PaCDM i^A 



where V., 



dK, 



dz 



(11) 



and Vq are the comoving volume in ACDM 
and Sim = = cosmologies, respectively. The relation 
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Figure 3. The relic radio luminosity at 1.4 GHz as a function 
of the cluster redshift, where the full circles are the data from 
Giovannini & Feretti (2004), and the solid line is our model best 
fit. 



between the luminosities i^cDM ^^'^ ^0 J^acdm / ^0 



(■Dacdm/-Co) , where 



and Dq are the corresponding 



distances in the two cosmologies (see Peacock 1985; Dunlop 
& Peacock 1990 for more details). In Fig. 2 we present the 
RLF for ACDM cosmology at six redshifts, from 0.1 to 6. 



4.2.2 Radio haloes & relics 

Diffuse non-thermal radio emission from the intra-cluster 
medium (ICM) is observed in about 30 percent of rich galaxy 
clusters, which can be found in dark matter haloes with 
A4aio ^ 5 X 1O^*M0. These sources are classified as radio 
haloes when their morphology is regular, centered on and 
resembling the X-ray emissivity, or radio relics when they are 
irregular, of elongated shapes, and located at the periphery 
of the cluster (e.g. Giovannini & Feretti 2004). For weak 
magnetic fields [B < 0.5pG), the inverse Compton losses are 
dominant, while for stronger magnetic fields, synchrotron 
emission is dominant (e.g. Cassano et al. 2006). 

Cassano et al. (2006) studied the statistics of giant ra- 
dio haloes (GRHs) from electron re-acceleration models. We 
use this study to determine the probability that a halo in 
the simulation hosts a GRH of a given radio emissivity, as 
function of the halo mass and redshift (see their Fig. 17a) . 
The radio halo luminosity at 1.4 GHz is 



LiA GHz = (4.85 ±1.64) X 10^ 



lOi^M, 



erg s 



(12) 



where Afvir is the virial mass of the halo in Mq, and 
= 2.9 ± 0.4. The radio halo luminosity per steradian 
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Table 1. The relevant components of the ISM: the warm neutral 
medium (WNM), the warm ionized medium (WIM), and the hot 
ionized medium (HIM), neglecting the molecular medium (MM) 
and the cold neutral medium (CNM) which axe completely neutral 

and occupy less than 2% of the galaxy voluirie. To is the gas 
temperature, is the hydrogen number density, Xc = ric/n^ 
is the ionization fraction, is the fraction of the interstellar 
volume, and \Z\ is the extent of the gas component normal to the 
disk plane. The following values are based on data from Heiles 
(2001) and Tiillmann (2002). 





Te [10* K] 


"h [cm 




fv 


\Z\ extent [kpc] 


WNM 


0.05-0.8 


0.1-0.5 


0.2 


0.4 


0.5 


WIM 


0.8-1 


0.05-0.08 


1 


0.1 


5 


HIM 


10-1000 


0.003-0.006 


1 


0.5 


7 



at frequency v is 

LlA GHz 



4n 



where = 1.2 (Cassano et al. 2006). 

Radio relics axe probably related to shock waves gener- 
ated by cluster mergers. These shock fronts can revive old 

radio ghosts when the thermal pressure is much higher than 
the magnetic field pressure (Eni31in & Goptal-Krishna 2001; 
Hoeft et al. 2004). Since the available data and knowledge 
on radio relics are still poor, there is no statistical model for 
the occurrence probability and radio luminosity of relics as a 
function of the halo mass and redshift. Therefore, we assume 
a 30% probability for a radio relic to exist in massive dark 
matter haloes, Mhaio ^ 5 x lO^'^M© (e.g. Di Matteo, Cia- 
rdi & Miniati 2004) . Giovannini & Forotti (2004) presented 
data from about 30 clusters of galaxies where at least one 
radio relic is present. We use these data to roughly estimate 
the relic radio luminosity at 1.4 GHz in erg s~^Hz~^ as a 
function of the cluster redshift z (see Fig 3) 

logLi.4 GHz = 7 + (25.48 ± O.27)0° °^«±° °°*erg s'^Hz-^ 



(14) 

The radio relic luminosity per steradians at frequency v is 

Ll.4 GHz 

ITTT^; ergs nz sr , 

(15) 



r / X Li.4 GHz / 1/ \ _i _i _i 



where a^, = 1.2 (Kempner et al. 2004). 



4-2.3 Star forming galaxies 

The gas in the ISM of star forming galaxies is mainly heated 
and ionized by supernovae explosions, stellar winds and cos- 
mic rays. We calculated the free-free emission from the ISM 
components with a significant ionized fraction: the warm 
neutral medium (WNM), the warm ionized medium (WIM), 
and the hot ionized medium (HIM). While neglecting the 
molecular medium (MM) and the cold neutral medium 
(CNM) which are completely neutral and occupy less than 
2% of the galaxy volume (e.g. Heiles 2001; Tiillmann 2002). 



In table 1 we present the physical properties of the rel- 
evant ISM components for free-free emission. We used these 
data and the free-free emissivity from equation (3) to calcu- 
late the luminosity of an ISM component i of a single star 
forming galaxy 

Ll{z) w €^{nl,T^)VZn{z) erg s"^ Hz"^ sr"\ (16) 

where nl = xl is the free electrons number density of the 
component, Tg is the component temperature, and Vj^^^ « 
2wf^r^{z)Z{z) is the component ionized volume, where r{z) 
is the galaxy radius, Z{z) w |Z|[r(«)/ro] is the extent height 
of the galaxy, and ro = 15 kpc is the approximate radius 
of the outer stellar ring of the Milky Way (e.g. Helmi et al. 
2003; Ibata et al. 2003; Yanny et al. 2003). 

We adopt the Shen et al. (2003) galaxy radius distri- 
bution to draw random radii in the sinnilatioii. Shen et al. 
(2003) used a complete set of about 140000 galajcies from the 
Sloan Digital Sky Survey (SDSS) to study the size distribu- 
tion of galaxies as a function of magnitude. They assume a 
log-normal distribution for the galaxy radius 



27rcr,, 



■ exp 



In^ (r/f 



2CT,2 



dr 

5 

r 



(17) 



where the median, f(M), and the dispersion, ainr(M), are 
functions of the r-band Petrosian absolute magnitude, M, 
in the range —24 < M < — 16. For late-type galaxies 



f(M) =[l + 10 
and 

(M) 



-0.4(M-Mo)"| (3-" ^Q-y-OAaM 



kpc. 



(72 -I- 



(18) 



(19) 



1 + 10-0.8{M-^Mo) ' 

where a = 0.26, /3 = 0.51, 7 = -1.71, ai = 0.45, a2 = 0.27, 
and Mo — —20.91 are fitting parameters. Following Blanton 
et al. (2001, 2003), we approximate the galaxy luminosity 
function in the r-band per unit magnitude by a Schechter 
function 

$(M)dM = O.41n(lO)0.1O-°''''^"'''-^("+'^ 

X exp [_io-o-4(A^-M.)j (20) 

where at redshift z = 0.1 the fitting parameters are 0* = 
0.0149 ± 0.0004 h^Mpc"^, M, - 51ogio h = -20.44 ± 0.01, 
and a — — 1.05±0.01. We scale the galaxy radii with redshift 

by r{z) oc H(z)"^ (Ferguson et al. 2004). 

In order to incorporate this contamination in the sim- 
ulation we determine the galaxy number counts for nor- 
mal spiral galaxies using the X-ray luminosity function from 
Ranalli et al. (2005), which based on Takeuchi et al. (2003, 
2004) far-infrared (FIR) luminosity function: 

with ip* 



exp 



~^log?o(l + ^)] (21) 



1.23 ± 0.04, a 



a 

0.724 ±0.01, and L* = (4.4 ±0.9) x 10® h-^'L©. The galaxy 



0.0234 ± 0.0030 h^Mpc'^ 

f h-2 

number counts per comoving volume at redshift z is 

,3 



N{z) 



= d 

^i„i„{^) 



logLv?(logL) h'^Mpc 



(22) 



where the minimum and maximum luminosities at x = are 
imin = 10^^ erg s~^ and Lmax = lO''^ erg s"'^, respectively. 
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We consider luminosity evolution of L oc (1 + zY'"^ (Norman 
et al. 2004; Ranalli et al. 2005). We further assume that 
the galaxy distribution follows the underlying mass density 
field. 

We use the calculation for free-free emission from star 
forming galaxies to estimate their synchrotron emission. 
Shaver et al. (1999) determine a ~70:1 ratio between galactic 
synchrotron and free-free emission at 150 MHz. Therefore, 
we assume that the synchrotron emission is 70 ± 10 times 
greater then the free-free emission at 150 MHz. The spectral 
index of the synchrotron omission depends on the relativis- 
tic electrons density distribution which varies from galaxy to 
galaxy and within each galaxy. Hummel (1991) analyzed the 
low frequency radio continuum data from 27 spiral galaxies 
with deferent galaxy inclinations. He found that the mean 
synchrotron spectral index of a galaxy at low frequencies 
{v < 700 MHz) varies from ~0.2 to ~0.9 with a mean value 
of a = 0.56 ± 0.15. Since our frequency range of interest 
is limited to the vicinity of 150 MHz (100-200 MHz), we 
simplify the frequency dependence by using a mean spectral 
index of a = 0.6 for all galaxies. 



4-2.4 Free-free emission from haloes and IGM 

The free-free emission from haloes comes from ionized gas 
which is confined to dark matter haloes, at the virial tem- 
perature. Photo-ionized diffuse g£is in the IGM, typically, 
has a mean temperature of 10* K. Therefore, only haloes 
with Tvir > 10* K axe able to confine gas to their gravi- 
tational potential well. The virial temperature of a halo of 
mass Mhaio is given by 



the NFW characteristic density, Ps, by p_2 = Ps/4, where 



^- = 3-«>^io'(Took^) ^' 

where the halo circular velocity is 
= [lOOnn.(a)H'(a)(GMhaio)'] '''' . 



(23) 



(24) 



Neglecting temperature variations across the halo, ac- 
cording to equation (3), the total free- free emissivity of the 
halo gas is proportional to the spatial integral of . Assum- 
ing further that the gas density follows that of the dark mat- 
ter, the total emissivity becomes proportional to the vari- 
ance of mass density fluctuations in the halo. To determine 
the density variance of a halo of mass, Mhaio, at redshift ,z, 
we use the improved halo density profile from Navarro ct 
al. (2004), which is based on the Navaxro, Frenk & White 
(1996, 1997, hereafter NFW) density profile 



p(r) 



= exp 



(25) 



where r .^ — Vvir/c is the characteristic radius in which the 
NFW density profile is proportional to r~ , rvir is the virial 
radius of the halo, and c is the concentration parameter. 
To calculate the concentration for arbitrary virial mass and 
redshift, we used the Eke et al. (2001) code available at J. 
F. Navarro's home page**. The density at r_2 is related to 



** A link to the ENS subroutines for calculation of concentra- 



200p, 



(26) 



'^^ 3[ln(l + c)-c/(l + c)]' 
where 

where Ho is the current value of Bubble's constant, and the 
halo mean density is 200p^,;t . Following Navarro et al. (2004) 
we choose a = 0.17. The mass density variance is 



(27) 



47r 2 



exp <^ 

a 



- 1 



r^dr, 



(28) 



where Kir = (47r/3)r 



shift z = 0, pI^JpI,, « 10* for Mhai, 



„2 
Pvar 



is the halo's virial volume. At red- 
10* Mq, and 



/Pi 



0.5 X 10* for Mhaio = 10'^ Mg 



We apply the friends-of-friends (FoF) algorithm to iden- 
tify haloes in the simulation box. Demanding haloes to have 
10 particles or more, gives a mass threshold of A/haio > 
2 X lO'^'^M© for haloes in the simulation. Haloes with lower 
masses are introduced in the simulation according to the 
Sheth & Tormen (1999) mass function. These haloes are 
placed at the positions of simulation particles not belonging 
to haloes identified by the FoF algorithm. We calculate the 
amount of radiation emanating from each cell in the simula- 
tion box by collecting the radiation from all the haloes with 
virial temperatures, Tvir > 10* K. 

The fraction of diffuse ionized gas in the IGM is taken 
to equal the fraction of mass in haloes with Tvir < 10* K. 
The free-free emission from diffuse ionized gas in the IGM 
has a small contribution to the overall contamination. The 
cumulative specific intensity for the diffuse free- free emission 
from the IGM is 



(29) 



r / J ei/(no,Te) _2 _i „ _i 

h = j Ax _^ erg cm s Hz , 

where dx = ca~^dt(z) is the differential conformal distance 
and ei/(ne,Te) is the free- free emissivity from equation (3). 



4.3 Results 

We have implemented the recipes for the foreground contam- 
inations in the simulation box. We work with 65536 lines- 
of-sight in a square window of 141.3 h~ Mpc (comoving) on 
the side and calculate the foregrounds' total intensity in the 
frequency range f = 80 — 240 MHz, corresponding to the 
21-cm signal at redshifts a « 5 — 17. In Fig. 4 we show the 
mean brightness temperature of the various foregrounds. As 
expected the most significant contamination is the galac- 
tic synchrotron foreground (~89.5%), next is the AGN ra- 
dio emission (~ 8.4 — 9.4%), and then the galactic free- free 
(~ 0.9 — 1.8%). The rest of the extragalactic foregrounds 
contribute on average ~0.15%. We also tried several cases 



tions for haloes of ajrbitrary 

mass and redshift can be found at J. F. Navarro's home page: 
http: / /www.astro.uvic.ca/~jfn/mywebpage/home.html 
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Figure 5. Maps of the brightness temperature foregrounds a.t u = 129.78 MHz: radio emission from AGNs (top-left), free-free from 
haloes (top-middle), free-free from diffuse IGM (top-right), synchrotron from star forming galaxies (center-left), emission from radio haloes 
(center-middle) and radio relics (center-right), free- free from star forming galaxies (bottom-left), galactic synchrotron (bottom-middle), 
and galactic free- free (bottom-right). 



where we changed the value of the galactic synchrotron emis- 
sion from a fairly low galactic foreground, which resemble 
high latitude regions (Shaver et al. 1999), where the contri- 
bution of the galactic synchrotron emission drops to ~77% 
of the total contamination (Asyn ~ 150 ± 15 K) and the 
galactic free-free emission adds ~1% {As — 2.250 ± 0.225 
K), to an extremely high galactic foreground, which might 
be found in radio loops (Reich & Reich 1988), where the 
galactic synchrotron emission contributes ~98% of the to- 
tal contamination (^syn = 2000 ± 200 K) and the galactic 
free-free adds ~0.4% {As = 6.33 ± 0.63 K). 

In Fig. 5 we present logarithmic maps of the bright- 
ness temperature of the integrated extragalactic foregrounds 
along the lines-of-sight which contaminate the cosmological 
21-cm signal at z = 10 {u = 129.78 MHz). In Fig. 6 we 
present the same logarithmic maps but for the brightness 
temperature contrast, log (Tb/ (Th)), where (Tb) is the mean 



brightness temperature in each panel. As expected the most 
dominant foreground is the galactic synchrotron (bottom- 
middle) and next is the radio emission from AGN (top-left). 
The free-free emissions from the IGM (top- right), haloes 
(top-middle) and star forming galaxies (bottom-left) con- 
tribute very little to the contamination. The emission from 
radio haloes (center-middle) and relics (center-right), which 
seems to be insignificant on average, can actually contribute 
significant contamination in individual lines-of-sight. Since 
there are only a few haloes with Mhaio ^ 5 x IO^^'Mq in the 
simulation box, for most lines-of-sight the contamination is 
practically zero, while in a few lines-of-sight the brightness 
temperature contamination from radio halo or relic can be 
as high as few 10^ mK. 
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X [Mpc/h] X [Mpc/h] X [Mpc/h] 

Figure 6. The same as Fig. 5, but for the brightness temperature contrast in logarithmic scale, log (Tj,/ (7b)), where (Tj,) is the mean 
brightness temperature in each panel. The three top panels are scaled from -0.3 to 0.35. The three central panels and the bottom-left 
panel are scaled from -0.8 to 2.8. The bottom-middle and right panels are scaled from -0.025 to 0.025. 



5 SIGNAL RECONSTRUCTION 

The MAP formalism have been used to reconstruct the 
large-scale structure of the Universe (e.g. Rybicki & Press 
1992; Fisher et al. 1995; Zaroubi et al. 1995). Based on 
this formalism, we develop a method to reconstruct the 21- 
cm cosmological signal from the contaminated data. The 
method relies on the smoothness of the contaminating radi- 
ation along the frequency axis and an assumed prior for the 
correlation properties of the cosmological signal. According 
to Bayes theorem the conditional probability distribution 
function for the occurrence of a 3D map, Tc, of a 21-cm 
cosmological signal given an observed data To is 



P(Tc|To) = 



P(TojTc)P(Tc) 



cx exp i - i ^ ^ + NJ^.^Tr" 



+ 



(Tc - WTr)" [S- 



N" 



(30) 

„,„ (Tc-WTr)"]}, 



where u and v are position vector indices, and W = 
S[S -I- N]~^ is the Wiener fiher. We assume that the sig- 
nal is approximately a Gaussian random field with an auto- 
correlation function S and that the (uncorrelated) noise is 
Gaussian with a diagonal correlation function N which is 
N„.„ — for u ^ V and N„.„ — for u = v where is 
the noise rms value assumed to be constant (for more details 
see Appendix Al). We determine Tr as the residue of the 
observed 3D signal. To, after subtracting the approximated 
foregrounds. The ith element along the line-of-sight I of Tr 



exp 




ln(^ 



(31) 



where we used a polynomial fitting in Xniv^jv^') of order 
= 2 for the foregrounds, and (Vi/i't) is the ratio be- 
tween the frequency in the ith bin and = 150 MHz. The 
coefficients of the polynomial, 6, ^ , are determined by min- 
imization of the first term of the exponent in the rhs of the 
conditional probability distribution function, equation (30), 
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Figure 4. The mean brightness temperature of the different fore- 
grounds as a function of frequency. The total galactic and extra- 
galactic foregrounds (thick solid line), the galactic synchrotron 
emission (thick dotted line), and the galactic free- free emission 
(thick dashed line), the radio emission from AGNs (thin long- 
dashed line), radio haloes (thin short-dash dotted line), and radio 
relics (thin long-dash dotted line), the synchrotron (thin long- 
dashed short-dashed line) and free-free emissions (thin short- 
dashed line) from star forming galaxies, and the free-free emission 
from haloes (thin dotted line), and the diffuse IGM (thin solid 
line). 



either in real space 
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Figure 7. The mean drms for all the lines-of-sight in redshift 
space as a function of the instrumental noise. The full circles 
are for nine values of instrumental noise, a^^ = 1, 5, fO, 20, 50, 
too, 150, 200, and 250 mK, with r] = I. The solid line is the 
best linear fit for these data points in logarithmic scale, (dims) = 
2.741no-„ +0.78. 



(32) 



or in fc- space 



2 ^ Vk 

k 



(33) 



where 7^,fe is the k vector element of the 3D Fourier trans- 
form of Tr, and Vk is the 3D power spectrum of the bright- 
ness temperature field. To find the best set of 6, ^ coeffi- 
cients, we minimize Q by an iterative steepest descent algo- 
rithm using the Newton direction as the descent direction 



(34) 



where VQ and V Q are the gradient and the Hessian of Q, 
respectively (for more details see Appendix A2). Then by 
solving the equation dP/dTc — 0, one gets the "optimal" 
signal 

Tc = WTr = S[S + N]"^Tr, (35) 
or in fc-space, dP/dTc,k ~ 0, 



Tc.k 



%-,k , 



(36) 



where Tc^k is the k element of the 3D Fourier transform of 
Tc, Vk = O^k)^. is the one-dimensional (ID) mean power 
spectrum, where k — \k\, and the Wiener filter parameter 77 
determines the strength of the Wiener filter. According to 
the Wiener filter formalism rj — 1, but in this case one does 
not recover all the small scale power since 

/ -v. \^ 

^rcc 
rk 



Vk 



Vk + ol 



{Vk + crl) = 



{(Tr^kf) 

Vk 



Tk + ol 



Vk. 



(37) 



Using 77 = 0.5 eliminates the power suppression at small 
scales and Vk^"^ ~ Vk, but slightly spoils the reconstruction 
results for each individual line-of-sight. 



5.1 Results 

We assess the ability of the method at reconstructing the 
cosmological signal using the simulation output around red- 
shift 2 = 10 in the frequency interval v = 124.8-136.0 MHz 
in 256 bins, and for 65536 lines-of-sight. Around this redshift 
the mean ionization fraction is /ion ~ 0.38. The galactic 
and extragalactic foregrounds are introduced as described 
in §4. We also add an uncorrelated instrumental random 
noise component, a^, and consider five values from CTj^ = 1 
to 250 mK. The method is then applied on noisy and con- 
taminated 3D brightness temperature maps generated from 
the simulation for redshift space^^. 

To estimate the accuracy of the signal cleaning process 
we calculated the root-mean-square (rms) of the difference 
between the true signal from the simulation, Tt^ue) a^nd the 
reconstructed signal from the observed data, T^ec, for all the 
points along each line-of-sight, I, 



true rec J / • 



(38) 



tt In redshift space the peculiar velocity (deviation from a pure 
Hubble flow) is taken into account in the relation between the 
emission and observed frequency. 
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Figure 8. A single line-of-sight signal reconstruction in redshift space for = 1 mK and drms = 1-77 (left panels) and (T^ = 5 mK and 
drms = 4.50 (right panels). Top panels: the brightness temperature, <5Tb, of the true cosmological signal (solid line) and the recovered 
signal (dashed line). Bottom panels: the residue, AT]-,, between the true and recovered signals (solid line) and between the true and the 
fitted foregrounds (dashed line). For all cases r] = 0.5. 



Table 2. The mean rms difference, drms, between the true cos- 
mological signal from the simulation and the reconstructed signal 
from the observed data for all lines-of-sight in redshift space. We 
present the results for five values of instrumental noise where the 
power index of the reconstructed signal is either ri = 0.5 or r; = 1. 





r) = 0.5 


V = 1.0 


(T„ [mK] 




(f^rms) 


1 


1.78 ±0.57 


1.78 ± 0.57 


5 


4.48 ± 0.34 


4.27 ±0.39 


20 


10.23 ± 0.62 


8.34 ± 0.77 


100 


20.17 ± 1.71 


13.54 ± 1.61 


250 


27.04 ±3.17 


15.62 ± 2.71 



In Fig. 7 we present the mean drms for all the lines- 
of-sight at redshift space as a function of the instrumental 
noise. We calculated (drms) nine values of noise from = 1 
to 250 mK (full circles). We also found the best linear fit 
in logarithmic scale, (drms) ~ 2.741n(Tpj +0.78. Since (drms) 
has a linear dependence in Ino-j^, we expect the results of 
the signal reconstruction process in higher noise levels to be 
similar to the results in the case of cr^ = 250 mK. 

In table 2 we present the calculation of the mean drms 
for all the lines-of-sight, for five values of instrumental noise 
= 1, 5, 20, 100 and 250 mK, and for tj = 0.5 and 1. 



As expected, rj — 1 gives better (drms) than 77 = 0.5 and 
the differences become more significant as the instrumental 
noise increases. In the case of CTj^ = 1 mK, (drms) is higher 
then the noise level. This happened because we did not take 
into account the errors in the foreground fitting which has 
an rms value is of the order of 2 mK. The foreground error 
rms stays approximately 2 mK also for higher values of cr„ 
and therefore becomes less significant as the instrumental 
noise increases. 

In Fig. 8 we present an example of a recovered cosmo- 
logical 21-cm signal compared with the true signal along 
a line-of-sight and for the noise levels: = 1 mK (top- 
left panel, drms ~ 1.77) and cr„ = 5 mK (top-right panel, 
drms = 4.50). The residue between the true and the recov- 
ered signals is [ATbl ^ 5 mK for — 1 mK (bottom- left 
panel) and jATbl < 12 mK for — 5 mK (bottom-right 
panel). In Fig. 9 we present results for one of the best recov- 
ered lines-of-sight, where the rms difference is drms = 0.93 
for ffpj = 1 mK and drms = 3.62 for = 5 mK. The sig- 
nal residue in this case is lATbl ^ 2 mK for — 1 mK 
(bottom-left panel) and lATbl < 10 mK for = 5 mK 
(bottom-right panel). For all cases in Figs. 8 and 9, rj = 0.5. 
Since the cosmological signal in the simulation box has a 
mean value of 5Tb = 10.15 ± 8.61 mK, there was no point 
to present the recovered signal for a single line-of-sight for 
higher values of ct„ . 
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Figure 9. Same as Fig. 8, but for one of the best reconstructions, where the rms difference is drms — 0.93 for cr-^ — 1 iiiK cind (irms — 

3.62 

for (T„ = 5 mK. 



We also compute the reconstructed signals for our range 
of low and high galactic foregrounds. We calculate the mean 
drms for these cases and found no significant differences to 
our "optimal" case. 



5.1.1 The power spectrum 

In Fig. 10 we compare the dimensionless power spectrum, 
A^(fe) = (F/87r^)47rfc^Pfc, of the brightness temperature, 
5Tb, between the true cosmological signal from the simu- 
lation (thin solid line), and the reconstructed signal with: 
cr„ = 1 mK (dotted hne) , (7^^ = 5 mK (dashed line) , cr„ = 20 
mK (long-dashed line), cr^ = 100 mK (dash-dotted line), 
and (Tpj = 250 mK (long-dash-dotted line), both for r; = 1 
(thick lines), and 77 = 0.5 (thin lines). As expected from 
equation (37), for 77 = 1 the power is suppressed at small 
scales (large wavenumbers k), while r] = 0.5 gives excellent 
fit for all a^. At large scales, where log(fc) < — 0.4, the size 
of the simulation box limited the ability to get sufficient 
statistics in the foreground fitting process. Therefore, the 
fitted foregrounds include some of the cosmological signal. 
In Fig. 11 we present a comparison of the true cosmologi- 
cal signal after applying a low-pass filter in fc-space, where 
fe < 0.1 h Mpc^^, with the differences between the true and 
the fitted foregrounds after applying the same low-pass filter, 
left and middle panels, respectively. One can see the similar- 
ity between the two maps which means that for large scales 
a significant part of the cosmological signal is included in the 
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Figure 10. The dimensionless power spectrum, A^(fc), of the 
brightness temperature, in redshift space, for the cosmolog- 
ical signal from the simulation (thin solid line), and the recon- 
structed signal with: = 1 mK (dotted line), a-^ = b mK 
(dashed line), Cj^ = 20 mK (long-dashed line), a-^ = 100 mK 
(dash-dotted line), and a-^ = 250 mK (long-dash-dotted line), 
where both r) = 1 (thick lines), and ri = 0.5 (thin lines) were 
used. 
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Figure 11. The left panel shows a 2D slice of the redshift space 3D map of the true 21-cm cosmological signal from the simulation 
ATb = (5Tt, — (<5Tb) at 2 ^ 10 after applying a low-pass filter in fc-space, where k < 0.1 h Mpc~^. The middle panel shows the differences 
between the true foregrounds and the fitted foregrounds for the same slice and after applying the same low-pass filter. The difference 
between the left and middle panels is drawn in the right panel. The thick solid contour indicates ATb = mK, while the thin solid and 
dotted contours, respectively, represent the brightness temperatures above and below zero. The contour spacing is 0.2 mK, the 2D slice 
is perpendicular to the lines-of-sight, and ctj^ = 1 mK 



fitted foregrounds. To overcome this problem one should use 
a larger simulation box. 

In Fig. 12 we present (5Tb the power spectrum in log- 
arithmic scale in the k± — fcy plane, where k± and fey are 
the wavenumbers perpendicular and parallel to the lines-of- 
sight, respectively. The power spectrum of the cosmologi- 
cal signal from the simulation in redshift space is presented 
in the top-left panel, while the rest of the panels show the 
power spectrum of the reconstructed signal from the ob- 
served data for five values of cr^: 1 mK (top-middle), 5 mK 
(top-right), 20 mK (bottom-left), 100 mK (bottom-middle), 
and 250 mK (bottom-right), where rj — 0.5 for all cases. One 
can see that as the noise level grows the power spectrum of 
the reconstructed signal becomes less and less reliable, espe- 
cially for large k± (small scales), where the high instrumen- 
tal noise eliminates the differences between large and small 

Nusser 2005b suggested to apply the Alcock-Paczyhski 
test on the observed maps of the cosmological signal (cf. 
Pandey & Bhardwaj 2005; Barkana & Loeb 2005). To do 
so one must compare the power spectrum in redshift space 
with power spectrum in real space. Figs. 13 and 14 are the 
same as Figs. 10 and 12, respectively, but for real space. 
The dimensionless power spectrum, A^(fc), is slightly lower 
in real space, and the two-dimensional (2D) power spectrum 
is symmetric in real space and asymmetric in redshift space. 
We see that a successful application of this test requires 
noise levels significantly lower than 100 mK. 

We also present (Fig. 15) the ratio, /p(k) = 
Piec{k) / Psig{k) — 1, in the k± — fey plane, between the power 
spectrum of the true cosmological signal, Paig(fc), and the 
power spectrum of the reconstructed signal, Proc(fc), where 
the top and bottom panels are for redshift and real space, re- 
spectively. We present the results for the same five values of 




-1 -0.5 0.5 1 
log[k (h Mpc-')] 



Figure 13. The same as Fig. 10 but in real space. 

a^: 1, 5, 20, 100, and 250 mK in five panels from left to right, 
respectively. For low instrumental noise of = 1 mK the 
results are reliable with no significant differences on average 
between the true and reconstructed signals and a variance 
of ~ 1% for log(fc) < 0.5 [h Mpc"^], both in real and red- 
shift space. The results for ct„ = 5 mK are still reasonably 
good with differences between the true and reconstructed 
signals of 5.0 ± 8.7% in redshift space and 8.7 ± 12.4% in 
real space, both for log(fc) < 0.5 [h Mpc~^]. As the instru- 
mental noise increases the differences between the true and 
reconstructed signals increases and the results becomes less 
and less reliable (for more details see table 3). 
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Figure 12. The power spectrum in logarithmic scale of the brightness temperature, ST^,, in the k± — fcy plane, where k± and A;|| are the 
wavenumbers perpendicular and the parallel to the lines-of-sight, respectively. The power spectrum of the cosmological signal from the 
simulation is presented in the top-left panel, while the rest of the panels show the power spectrum of the reconstructed signal from the 
observed data for five values of <Tj^: 1 mK (top-middle), 5 mK (top-right), 20 mK (bottom- left), 100 mK (bottom- middle), and 250 mK 
(bottom-right). All cases are in redshift space, and r] = 0.5 was used to reconstruct the signals. 

Table 3. The mean differences (in percentages) between the true cosmological signal from the simulation and the reconstructed signal 
from the observed data for five values of and three different maximal log(fc), both in redshift space and real space. The Wiener filter 
parameter is ?j = 0.5 for all cases. 



Redshift space Real space 

log(fc) [h Mpc-l] 

(T„ [mK] <: 0.03 ^ 0.21 ^ 0.51 <: 0.03 ^ 0.21 ^ 0.51 



1 
5 

20 
100 
250 



-0.4 ±0.9 
0.2 ± 1.4 
9.4 ±8.7 
180.5 ± 159.0 
574.9 ± 640.3 



-0.3 ±0.8 
0.8 ± 1.7 
17.6 ± 18.7 

291.1 ±345.0 

881.2 ± 1336.1 



0.0 ±0.8 
5.0 ±8.7 
71.7 ± 114.5 
912.3 ± 1797.4 
2627.0 ±6452.1 



-0.1 ± 1.2 
0.8 ± 1.9 

17.1 ± 11.6 
254.2 ± 143.1 
620.4 ±500.5 



-0.1 ± 1.0 
1.8 ±2.2 

29.3 ±21.3 
350.4 ±300.0 
821.7 ± 1018.4 



0.3 ± 1.0 
8.7 ± 12.4 
100.7 ± 124.7 
885.6 ± 1568.6 
2060.9 ±4841.7 



5.1.2 The Minkowski functionals 

The morphology of the total gas and the neutral hydrogen 
distributions during the cosmological epoch of reionization 
can be quantified with Minkowski functionals (MFs) of iso- 
density surfaces. Gleser et al. (2006) suggested the MFs as 
a means of investigating the reionization epoch. In Figs. 16 
and 17 we present the MFs from the 21-cm cosmological 
signal in redshift and real space, respectively, where Vo(i') 
is the volume of regions with brightness temperature above 
the threshold Tthr, Vi(v) is a measure of the surface area of 
the boundary, V2{i>) is the mean curvature over the surface. 



and Vziv) is the Euler characteristic, x- We compare the 
MFs of the true cosmological signal brightness temperature 
(thin solid line), and the reconstructed signal with: cTj^ = 1 
mK (dotted line), (t„ = 5 mK (dashed line), (t„ = 20 mK 
(long-dashed line), o-^ — 100 mK (dash-dotted line), and 
= 250 mK (long-dash-dotted line), both for rj = 1 (thick 
lines), and r; = 0.5 (thin lines). Since the MFs are sensi- 
tive to the residual noise and foreground contamination, we 
smooth the 21-cm maps with a Gaussian filter of width 0.55 
h~^Mpc before applying the MFs computing code to them. 

A comparison of the Vo curves in the figures shows that 
the reconstructed maps have fewer regions with high bright- 





-0.5 0.5 -0.5 0.5 -0.5 0.5 -0.5 0.5 -0.5 0.5 

log(k [h Mpc"^]) 



Figure 15. The ratio, /pj^^ = Piccik) / Psig{k) — 1, between tlie recovered and true signal power spectrum in redsliift space (top panels) 
and real space (bottom panels). Tlie results for five values of cr^ : 1, 5, 20, 100, and 250 mK are presented in five panels from left to right, 
respectively. For all cases rj = 0.5. 



ness temperature than the true cosmological signal This 
effect increases with the noise rms . Both in real and red- 
shift space, rj = 0.5 perform better than 77 = 1 at recovering 
the true MFs. In any case, the recovery of the MPs is rea- 
sonable for — 1 and 5 mK, suggesting that in the future, 
when the noise level will be reduced to 0-^, « 5 mK, the 
MFs could be used to distinguish between different stages 



of ionization and between different reionization scenarios as 
advocated by Gleser et al. (2006). 
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Figure 16. The Minkowski functionals in redshift space of the 
true cosmological signal brightness temperature (thin sohd line), 
and the reconstructed signal with: = 1 mK (dotted hne), 
fT], = 5 mK (dashed Hne), tr^ = 20 mK (long-dashed line), cr^ = 
100 mK (dash-dotted line), and a-^ = 250 mK (long-dash-dotted 
line), where both r) = 1 (thick lines), and r) = 0.5 (thin lines) were 
used. 




- 7J=1.0 

- 7,=0.5 




0.0 0.2 0.4 
log(Tthr/<'5Tb>) 



-0.2 0.0 0.2 0.4 
log(Ty,y<tfTj,>) 
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log(T,„y<(5T,>) 



Figure 17. The same as Fig. 16 but in real space. 



6 DISCUSSION 

Maps of 21-cm radiation from diffuse H I before and dur- 
ing the epoch of reionization are potentially our only direct 
probe of the Universe in the dark ages. From this point of 
view the efforts invested in modeling this radiation and in at- 
tempts to observe it are worthwhile. Redshifted 21-cm maps 
are likely to constrain the early stages of star and galaxy for- 
mation and also the state of the diffuse H I before any light 
emitting object had formed. Decoupling of the H I spin tem- 
perature, Ts, from the CMB temperature, T^^-^ , is necessary 
for observing redshifted 21-cm (either in absorption or in 



emission). In the absence of any heating mechanisms, the 
kinetic gas temperature, Tk, drops below T^^^^ at redshifts 
z < 100. For z > 20 the Universe is dense enough so that 
atom-atom collisions are still efficient at bringing Ts down 
towards Tk. However, even in the absence of luminous struc- 
ture, the gas temperature could be raised above the CMB if 
the dark matter could decay or annihilate (e.g. Ripamonti, 
Mapelli & Ferrara 2007). For example, neutralinos, if they 
are the dark matter particles, produce sufficient electron- 
positron e~e^pairs by self-annihilations (Myers & Nusser 
2007). High energy photons generated by inverse Compton 
scattering of CMB photons, could then photo-heat the gas 
to kinetic temperatures above T^^^- In this case, the gas 
is never seen in absorption. Therefore, 21-cm maps could in 
principle also constrain the nature of the dark matter par- 
ticle. 

The 21-cm cosmological signal will suffer from several 
sources of contamination. In the first part of this work 
we model the contamination which is produced by fore- 
ground radiation emitted from galactic and extragalactic 
sources, and produced 3D contamination maps. The fore- 
grounds model includes synchrotron and free-free emission 
from our Galaxy and seven extragalactic foregrounds: ra- 
dio point sources which relate to AGN activity, radio haloes 
and radio relics from massive dark matter haloes with mass 
of Afhaio ^ 5 X IO^^Mq, synchrotron and free-free emission 
from star forming galaxies, and free-free emission from ion- 
ized hydrogen dark matter haloes and diffuse IGM. The 
most significant foreground is the galactic synchrotron which 
cause ~90% of the contamination on average. The galactic 
free-free contributes another 1-2% to the total contamina- 
tion. The AGN radio point sources produce ~10% of the to- 
tal contamination on average, but can reach ~25% at high 
galactic latitudes, where the minimum brightness tempera- 
ture of the diffuse galactic emission drops to ~200 K. Radio 
haloes and radio relics are also significant foregrounds, but 
since they appear only in rich galaxy clusters, they are rare 
and should appear only as spots in individual lines-of-sight. 
The remaining extragalactic foregrounds are less significant 
and contribute less the 1% to the total contamination. 

In the second part we add the 3D contamination maps 
and an instrumental random noise to the maps of the cos- 
mological 21-cm signal, and apply a reconstruction method 
based on a Bayesian statistical approach. We take advan- 
tage of the foregrounds' smooth dependence on frequency 
in contrast to the "noisy" signal. Then we derive an appro- 
priate Wiener filter to extract the cosmological signal from 
the residual brightness temperature obtained after subtract- 
ing the approximated foregrounds from the observed data. 
We also introduce a power index rj which determines the 
strength of the Wiener filter. The Wiener filter obtained di- 
rectly from Bayesian statistics has rj — 1, but using rj — 0.5 
eliminates the power suppression at small scales. For low in- 
strumental noise we were able to get reasonable reconstruc- 
tion of the cosmological signal for each line-of-sight, where 
drms « 1.7 ± 0.6 for o-„ = 1 mK, and d^ms « 4.2 ± 0.4 for 
= 5 mK. We checked the reconstruction of the statistical 
measurements of the power spectrum and the MFs. The ID 
power spectrum was nicely reconstructed for all values of 
the instrumental noise up to 250 mK, while the 2D power 
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spectrum and the MFs were reconstructed reasonably well 
only for noise levels significantly lower than 100 mK. 
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APPENDIX A: THE METHOD OF SIGNAL 
RECONSTRUCTION 

The maximum a-posteriori probability (MAP) formalism 
have been used to reconstruct the large-scale structure of 
the Universe (e.g. Rybicki & Press 1992; Zaroubi et al. 1995; 
Fisher et al. 1995). Based on this formalism, we develop a 
method to reconstruct the weak 21-cm cosmological signal 
from the contaminated noisy data. The method relies on the 
smoothness of the contaminating radiation along the fre- 
quency axis and an assumed prior for the correlation prop- 
erties of the cosmological signal. 



Al The conditional probability 

Let P{s\d) be the conditional probability distribution func- 
tion to get the signal vector s given the data vector d. The 
MAP method estimates s by maximization of P{s\d). Ac- 
cording to Bayes theorem 



Pis\d) = 



P{d\s)P{s) 
Pid) ■ 



(Al) 



Since the maximization is on s, the denominator P{d) can 
be ignored. In addition to the true signal, the data d con- 
tains contamination from different foreground sources and a 
random (instrumental) noise. Here c represents the fit to the 
total contamination. The ith. element of the contamination 
along a line-of-sight I is 



exp 




(A2) 



where we use a polynomial approximation in a;, . = 
ln(j/, ■/ly,) of order . is the frequency at position i 

along the line-of-sight Z, is a constant frequency, and 
6,^ are the polynomial coefficients. W Usually, the frequency 
. = v. is the same for all Z's, and therefore, the vector x 
contains n, times the vector x. = ln(!^^/i^, ), where n, is the 
number of lines-of-sight. We determine the residue vector 
y = d — c as the residual data d after subtraction of the 
approximated contaminations c. So ideally, y contains only 
the true signal and the random noise. 

Assuming that the signal is approximately a Gaussian 
random field with an auto-correlation matrix S, the proba- 
bility function of s is 



P(s) = exp(-isTs-is). 



(A3) 



Further, assuming a Gaussian noise matrix N, the probabil- 
ity for d giving s is 



P{d\s) = exp 



l{y - sfN-\y - s) 



(A4) 



In the following we assume N = 1(7^, , where is the noise 
rms and I is the identity matrix. 



W We use two indices I and i to determine the position of each 
element in the vectors c and x. One can replace I and i with a 
single index j where j = n^{l — 1) + i and n- is the number of 
frequencies along a single line-of-sight. 



Using the above probability functions (equations (A3) 
& (A4)), the probability P{s\d) is proportional to 

P(s|d) oc P{d\s)P{s) 

= {-^ [iv - sf'^~\y - s) + s'^S-'s] } 
= exp {-i [y^N-^y + s^(S-' + N-')s 

-y'^N'^s - s'^N'^y]} , 
= exp {-i [2/^(S + Ny'y + s^(S-^ + N-')s 

-hy^N-iS(S + N)-'y 
- ^"^8(8 + N)"^(8"^ + N'^)s 
-s^(S-^-f N-^)S(S + N)-^y]}, 



(A5) 



where we use the matrix identity 



N" 



: N-i(S -I- N)(S -I- N)-i = (N-^S + I)(S + N)-i 



= N-^S(S-hN)-^-|-(S-|-N)-^ (A6) 
for the first term in the rhs of the equation, and 
(S"^ + N"^)-^ = S(S -I- N)-^N = N(S + N)-^S (A7) 

for the third and the fourth terms in the rhs of the equation. 

Next we again use the identity in equation (A7) for the third 
term in the rhs of equation (A5), and get 

P{s\d) oc exp {-^ [^"^(8 + N)-iy + s'^iS'^ + N-^)s 

+ y'^8(8 + N)-i(8-' + N-i)8(8 + N)-iy 

-y'^8(8-hN)-i(8-i + N-')s 
-s^(S-^ +N-^)8(8 + N)-"^y]} 

= exp{-l [y'^{S + N)-'y + s'^{S-' +N-')s 

+ (S(S + N)-'y)'^ {S'' + N-') (8(8 + N)" 
- (8(8 + N)-iy)'^(8"^ +N-^)s 
-s^(8-^+N-^)(8(8 + N)-^y)]} 

= exp {-i [yT(8 + N)-iy + {s - 8(8 + Ny'y)^ 
x(8-^+N-i)(s-8(8 + N)-iy)]}, 



A2 Fitting the contamination 

The fitting process of the total contamination, c, depends on 
two assumptions: (i) the signal is much weaker then the to- 
tal contamination, and (ii) the contamination has a smooth 
frequency spectrum while the signal is rapidly changing in 
frequency. Once these assumptions are valid, we can switch 
between the signal and the contamination, and treat the 
contamination as the desirable signal and the signal as an 
additional random noise. Now, the polynomial coefHcients, 
6,^, can be calculated by minimization of the first term of 
the exponent at the rhs of the conditional probability dis- 
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tribution function (equation (AS)) 



^ (A9a) 



i=l i=l l' = l i'=l 



dq 
aft,™ 



n . n, n • 



E E E c^'.. + N)- ('^..'-c.,,' ) ' 



(A9b) 



n . n. 



EEK'=<.^(s+N)-^,,,a.::^,c,,, 

i=l 

-5,-.<r"'=M(S + N)ri'y(d,,.,-c,..,)] 

(A9c) 



where i and i' arc the frequency position indices, from 1 to 
, along the lines-of-sight I and I' , respectively. The number 
of lines-of-sight is n, , and 5^_^, is the Kronecker delta which 
equal one for I = and zero otherwise. Equations (A9b) 
and (A9c) are the elements of the gradient and Hessian, 
respectively, where the are + 1 coefficients, b,^ , for each 
line-of-sight. 

The minimization can be calculated also in Fourier- 
space 



_ 1 \ ^ (Pfc — Cfc) 



k 



(AlOa) 
(AlOb) 



E 



-S, 



(z^)-^fc (^„) 

Tu (g„+„) (Pfc -Cfc) 



(AlOc) 

where 3^*, , 'Dfc , and Ck are the fe elements of the Fourier trans- 
form of y, d. and c, respectively, Vk is the field power spec- 
trum, and Tk (-Sm) is the k element of the Fourier transform 
of z„ , where the Z", i element of is equal to a^, ^c^,, ^ for 
I" = I and zero otherwise. 



A3 The reconstructed signal 

To get the optimal reconstructed signal, s, one should 
minimize the conditional probability distribution function 
dP{s\y)/ds = 0, 

s = S(S-hN)-iy, (All) 

or in Fourier-space, dP/dS = 0, 

Vi. 

where Sk is the k element of the Fourier transform of s, and 
Vk/iVk IS the Wiener filter. 



